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Arabidopsis thaliana is a model plant species and its molecular 
dissection has greatly contributed to our understanding of 
the systems preventing genome invasion by transposable 
elements (TE). Recent advances suggest that A. thaliana may 
be more efficient than its congener A. lyrata at controlling TE 
expression and proliferation. The comparative analysis of TE 
transcription in A. thaliana and A. lyrata, which differ by 40% in 
genome size, may help understand how silencing mechanisms 
contribute to the evolution of transposition rate, an important 
factor controlling genome size variation in plants and animals. 



The number of transposable elements (TEs) is known to vary 
greatly among plant genomes and evolves as the net outcome of 
three major factors: transposition activity, TE removal and effi- 
ciency of purifying selection as determined by population genet- 
ics processes. 1 " 3 In the genus Arabidopsis, the two species A. lyrata 
and A. thaliana differ by -40% in genome size. 4,5 As much as 
56% of the A. lyrata genome sequence that cannot be aligned to 
A. thaliana encodes TEs or simple repeats. 5 Differences in TE 
content therefore explain a significant fraction of genome size dif- 
ferences. Out-crossing species are expected to be more efficient in 
removing deleterious mutations than inbred species. 1 As a con- 
sequence, new transposition variants with deleterious effects on 
plant fitness should segregate at lower frequency in the species 
that can best purify deleterious mutations, i.e., in the outbred 
A. lyrata. However, transposition variants are not segregating at 
lower frequency in A. lyrata. 6,1 Therefore, population dynamics 
of deleterious mutations do not influence significantly the differ- 
ence shown by A. thaliana and A. lyrata in TE content. 2 

Instead, evidence is accumulating that the two species differ 
in the transcriptional control of TEs. In the model plant spe- 
cies A. thaliana, this control is increasingly well understood. 8 " 11 
Silencing of TEs is mediated by small interfering RNAs (siR- 
NAs) that guide the deposition of DNA and histone methylation 
marks on homologous DNA stretches. 12 " 14 These marks then act 
in cis- to repress expression. In A. thaliana, most TEs reside in 
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pericentromeric chromosomal regions that are transcriptionally 
inactive. 15,16 Under stressful conditions, silencing can be tran- 
siently suppressed, making room for potential transposition. 17,18,19 
Knocking-out distinct components of the transcriptional gene 
silencing system can also activate specific subsets of TEs. 15,20 The 
analysis of small RNAs expressed in both species showed that 
A. thaliana appears to be able to repress TE transcription more 
efficiently than A. lyrata. The specificity of siRNAs for their tar- 
get TEs is strongly associated with the efficacy of TE silencing 
in both A. lyrata and A. thaliana, but A. thaliana shows a 3-fold 
greater proportion of siRNAs mapping uniquely to a single TE 
copy. 21 This finding agrees with the observation that a greater 
number of TEs are expressed in A. lyrata. 2 ^ In A thaliana, genes 
located less than 2.5 kb away from TEs tend to be less expressed, 
suggesting that silencing TEs can entail negative consequences 
for plant fitness if protein-coding genes in the vicinity of TE 
insertions are partially silenced and cannot be expressed prop- 
erly. 22 This cost seems weaker in the congeneric species A. lyrata 
where TEs are more abundant and tend to be closer to expressed 
genes. Indeed, in this species, TE insertions are associated with a 
reduced expression of their closest protein-coding neighbor only 
if they are located less than 1.5 kb away. 21 Interestingly, protein- 
coding genes showing greater expression of the A. lyrata allele in 
Fl interspecific hybrids confirm that the silencing effect of TE 
proximity is greater in A. thalianaP 

Using a novel approach to monitor genome-wide variation in 
TE aV-regulation, He et al. in the de Meaux laboratory have dem- 
onstrated major interspecific differences in the rw-regulation of 
TE silencing. 24 In Fl interspecific hybrids, monitoring of allele- 
specific expression at 1535 loci annotated as TEs in A. thaliana 
showed that as many as 47% of TE loci display allele-specific 
expression and thus differ in their cw-regulation. Interestingly, 
almost all differentially expressed TEs expressed the A. lyrata 
allele more than the A. thaliana allele. Allele-specific expression 
of TEs is not observed between A. thaliana accessions. 25 He, 
de Meaux and collaborators 24 confirmed on a sub-sample of 18 
loci that the upregulation of A. lyrata alleles in Fl interspecific 
hybrids reflects parental differences in TE expression, and is not 
simply a result of TE upregulation induced by "genomic shock," 
a phenomenon often observed in interspecific hybrids or syn- 
thetic Arabidopsis allopolyploids. 26 A previous analysis based on 
RNA-sequencing data reported that 8% of A lyrata TE loci are 
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expressed, 21 but the study of He et al. 24 might be more sensitive 
as the approach is not affected by coverage issues that prevent the 
detection of lowly expressed genes. Importantly, this study fur- 
ther supports the hypothesis proposed by Hollister et al. 21 that 
interspecific differences in TE regulation depend on cw-acting 
marks deposited by the epigenetic machinery. For ten of 1 1 TEs 
examined, H3K9me2 methylation, a silencing histone mark, was 
detected in A. thaliana but not in A. lyrata?* In addition, dif- 
ferentially regulated TEs were significantly enriched among TEs 
controlled by the DNA methyl-transferase METl, suggesting 
that allele-specific expression of TEs depends on CG methyla- 
tion. 24,27 Since A. lyrataTE alleles are systematically upregulated, 
it is possible that unknown changes in the epigenetic machinery 
of the two species create genome-wide differences in epigenetic 
marks, which are maintained in interspecific Fl hybrids" and 
act in cis- to preferentially silence the A. thaliana allele. We sus- 
pect that interspecific differences in the epigenetic machinery 
are mostly quantitative: A. lyrata METl shares 90% amino 
acid identity with its A. thaliana ortholog (Pecinka A, unpub- 
lished data) and appears functional. In addition, A. thaliana and 
A. lyrata seem to have the same repertoire of epigenetic marks, 24 ' 28 
although the A. lyrata epigenome has not yet been characterized. 
Further experiments are warranted to confirm this hypothesis. 
In fact, sequence differences in TE regulatory regions probably 
also contribute to methylation differences between TE alleles. 
The endosperm expressed-gene FWA, for example, shows inter- 
specific variation in patterns of epigenetic silencing likely driven 
by the presence of repeat sequences in the promoter 28 and dif- 
ferential TE regulation associates with nucleotide differences in 
TE upstream regions. 24 

TE transcriptional upregulation is sometimes associated 
with increased transposition rates. 2 Do the cis-regulatory dif- 
ferences observed for Arabidopsis TEs 24 reflect a difference 
in transposition activity? The genomic sequence of A. lyrata 
revealed that recently inserted TEs were more abundant in 
A. lyrata than in A. thaliana? ' 6,29 Therefore, the genome size 
difference between the two species is not only due to a massive 
removal of TEs in A. thaliana: TE amplification in A. lyrata 
also plays a role. He et al. 24 found that TEs showing differences 
in cw-regulation do not show a comparatively higher number 
of copies. Using published age estimates, 5 they observed that 
LTR retrotransposons differentially expressed in Fl hybrids 
are among the youngest TEs in the A. thaliana genome. This 
suggests that A. thaliana silences recent TE copies more effec- 
tively, although older copies are similarly regulated in both spe- 
cies, a finding in agreement with the interspecific differences 
in uniquely mapping siRNAs mentioned above. 21 Nonetheless, 
TE age was estimated based on the divergence of the termi- 
nal repeats in LTR retrotransposons, which are identical at 
the time of insertion. 30 We observed that age estimates are 
not congruent between species. Orthologous LTR retrotrans- 
posons, defined as elements flanked by orthologous neighbor 



genes, were estimated to be older in A. lyrata. Since orthol- 
ogous TE insertions should be the same age, this questions 
the accuracy of LTR age estimation. Population parameters, 
which differ widely between outcrossing A. lyrata and selfing 
A. thaliana?' probably influence the rate of LTR divergence, 
making interspecific TE age comparisons difficult. In addition, 
age estimates would be flawed if LTRs undergo gene conver- 
sion. 30 To our knowledge, the role of this potential confound- 
ing factor has not been considered in these two species. The 
age distribution of LTR retrotransposons in the two species 
therefore needs to be re-examined in greater depth. The exis- 
tence of inter-specific differences in transcriptional silencing of 
TEs is now clear but whether they have caused the widely dif- 
ferent TE contents of the genomes of A. lyrata and A. thaliana 
remains to be demonstrated. 

Interestingly, not only the host silencing system is evolving in 
the Arabidopsis genus: the elements themselves have evolved new 
functions that may also contribute to interspecific differences in 
TE proliferation. A groundbreaking study has demonstrated that 
orthologous families of TEs can evolve different transposition site 
preferences. Tsukahara et al. 29 introduced Tall, an A. lyrata mem- 
ber of the COPIA93 1 Evade TE family, 15,16 into the A. thaliana 
genome and monitored its activity. In A. lyrata, this TE recently 
proliferated in centromeric regions. In contrast, COPIA93 mem- 
bers in A. thaliana are found in low copy numbers and insert in 
chromosome arms. When introduced into A thaliana, Tall is not 
efficiently silenced, and proliferates by integrating specifically in 
the centromeres. The rate of Tall transposition is magnified in a 
ddml mutant background, suggesting that Tall does not entirely 
escape the A. thaliana defense system. 29 The release of COPIA93 
silencing in A. thaliana ddml mutants leads to new integrations 
in chromosome arms but never in centromeres. Preferential inser- 
tion into centromeric regions seems to be an ancestral property 
of Tall and thus COPIA93 shows a modified insertion site pref- 
erence in the A, thaliana lineage. This experiment provides an 
admirable demonstration that TEs have evolved new insertion 
abilities since the separation of A. lyrata and A. thaliana. 

The rich molecular and genomic toolbox available in A. thali- 
ana and the availability of the A. lyrata genome should facilitate 
the identification of molecular factors controlling both host and 
invader variation. Interspecific variation in the Arabidopsis genus 
therefore promises to bring novel insights into the mechanisms 
controlling the evolution of both "selfish DNA" and its control 
by the host's genomic system. It also promises to shed light on 
a long-standing question: whether selfish DNA and the host 
defense system coevolve. 
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